Domain Specific Ontology Extractor For Indian Languages

نویسندگان

  • Brijesh Bhatt
  • Pushpak Bhattacharyya
چکیده

We present a k-partite graph learning algorithm for ontology extraction from unstructured text. The algorithm divides the initial set of terms into different partitions based on information content of the terms and then constructs ontology by detecting subsumption relation between terms in different partitions. This approach not only reduces the amount of computation required for ontology construction but also provides an additional level of term filtering. The experiments are conducted for Hindi and English and the performance is evaluated by comparing resulting ontology with manually constructed ontology for Health domain. We observe that our approach significantly improves the precision. The proposed approach does not require sophisticated NLP tools such as NER and parser and can be easily adopted for any language.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Public Transport Ontology for Passenger Information Retrieval

Passenger information aims at improving the user-friendliness of public transport systems while influencing passenger route choices to satisfy transit user’s travel requirements. The integration of transit information from multiple agencies is a major challenge in implementation of multi-modal passenger information systems. The problem of information sharing is further compounded by the multi-l...

متن کامل

Improving Classification of Multi-Lingual Web Documents using Domain Ontologies

In this paper, we deal with the problem of analyzing and classifying web documents to several major categories/classes in a given domain using domain ontology. We present the ontology-based web content mining methodology that contains such main stages as collecting a training set of labeled documents from a given domain, building a classification model above this domain given the domain ontolog...

متن کامل

Using the Ontology Paradigm to Integrate Information Systems Oveia: Expanding the Topic Maps frontier

Ontology based websites are one possible implementation of the Semantic Web. There are several languages for ontology specification: RDF, OWL, Topic Maps. Topic Maps follow a structure formally specified what makes them a good choice for semantic website specification. The process of ontology development based in topic maps is complex, time consuming, and it requires a lot of human and financia...

متن کامل

Improving the Search for Learning Objects with Keywords and Ontologies

We report on an ongoing project which aims at improving the e ectiveness of retrieval and accessibility of learning object within learning management systems and learning object repositories. The project Language Technology for eLearning approaches this task by providing Language Technology based functionalities and by integrating semantic knowledge through domain-speci c ontologies. We will re...

متن کامل

Classification of Web Documents Using Concept Extraction from Ontologies

In this paper, we deal with the problem of analyzing and classifying web documents in a given domain by information filtering agents. We present the ontology-based web content mining methodology that contains such main stages as creation of ontology for the specified domain, collecting a training set of labeled documents, building a classification model in this domain using the constructed onto...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012